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Abstract 

We consider the Erdos-Renyi random graph G{n,p) inside the critical window, that is when p = 
l/n + An-^/^ for some fixed A G M. Then, as a metric space with the graph distance rescaled by n ^ , 
the sequence of connected components G{n,p) converges towards a sequence of continuous compact 
metric spaces. The result relies on a bijection between graphs and certain marked random walks, and 
the theory of continuum random trees. Our result gives access to the answers to a great many questions 
about distances in critical random graphs. In particular, we deduce that the diameter of G{n,p) rescaled 
by n~^^^ converges in distribution to an absolutely continuous random variable with finite mean. 
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1 Introduction 

Random graphs and the phase transition 

Since its introduction by Erdos and Renyi [20], the model G(n,p) of random graphs has received an enormous 
amount of attention [11, 28]. In this model, a graph on n labeled vertices {1,2, ... ,n} is chosen randomly 
by joining any two vertices by an edge with probability independently for different pairs of vertices. This 
model exhibits a radical change in structure (or phase transition) for large n when p = p{n) ~ 1/n. For 
p ~ c/n with c < 1, the largest connected component has size (number of vertices) O(logn). On the other 
hand, when c > 1, there is a connected component containing a positive proportion of the vertices (the giant 
component). The cases c < 1 and c > 1 are called subcritical and supercritical respectively. This phase 
transition was discovered by Erdos and Renyi in their seminal paper [20]; indeed, they further observed that 
in the critical case, when p = 1/n, the largest components of G{n^p) have sizes of order n^/^. For this 
reason, the phase transition in random graphs is sometimes dubbed the double jump. 

Understanding the critical random graph (whenp = p{n) ^ 1/n) requires a different and finer scaling: the 
natural parameterization turns out to be of the form p = p{n) = 1/n + An~^/^, for A = o(n^/^) [12, 35, 38]. 
In this paper, we will restrict our attention to A G M; this parameter range is then usually called the critical 
window. One of the most significant results about random graphs in the critical regime was proved by Aldous 
[7] . He observed that one could encode various aspects of the structure of the random graph (specifically, the 
sizes and surpluses of the components) using stochastic processes. His insight was that standard limit theory 
for such processes could then be used to get at the relevant limiting quantities, which could, moreover, be 
analyzed using powerful stochastic-process tools. Fix A G M, set p = 1/n + An~^/^ and write Z"^ and for 
the size and surplus (that is, the number of edges which would need to be removed in order to obtain a tree) 
of , the i-th largest component of G{n,p). Set = (Zf , Z^^, . . . ) and = (6'^ 6'^^, . . . ). 
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Theorem 1 (Aldous [7]). As n ^ oo. 

(n-2/^Z^,S") 4 (Z,S). 

Here, the convergence of the first co-ordinate takes place in the set of infinite sequences (xi, X2, . . . ) 
with xi > X2 > ■ - ■ > and Xl^>i < oo. (See also [27, 38].) The limit (Z, S) is described in terms of a 
Brownian motion with parabolic drift, {W^{t),t > 0), where 

W\t) :=Wit)+tX-*^, 

and {W{t)^t > 0) is a standard Brownian motion. The limit Z has the distribution of the ordered sequence 
of lengths of excursions of the reflected process W^{t) — mino<s<^ W^{s) above 0, while S is the sequence 
of numbers of points of a Poisson point process with rate one in x R+ lying under the corresponding 
excursions. Aldous's limiting picture has since been extended to "immigration" models of random graphs [8], 
hypergraphs [25], and most recently to random regular graphs with fixed degree [43]. 

The purpose of this paper is to give a precise description of the limit of the sequence of components 
Qn _ (^Qn^Qn^ _ )^ Here, we view C]^,C2 , • • • as metric spaces , M2^, . . ., where the metric is the usual 
graph distance, which we rescale by n~^/^. The limit object is then a sequence of compact metric spaces 
M = (Ml, M2, . . . ). The appropriate topology for our convergence result is that generated by the Gromov- 
Hausdorff distance on the set of compact metric spaces, which we now define. Firstly, for a metric space 
(M, J), write dn for the Hausdorff distance between two compact subsets K^K' of M, that is 

dH{K, K') = inf{e >0:K C F,{K') and K' C F,{K)}, 

where F^{K) := {x G M : 5{x^ K) < e} is the e-fattening of the set K. Suppose now that X and X' are two 
compact metric spaces, each "rooted" at a distinguished point, called p and p' respectively. Then we define 
the Gromov-Hausdorff distance between X and X' to be 

dGH{X, X') = mi{dH{^{X),^\X')) V 5{^{p), ^{p'))} 

where the infimum is taken over all choices of metric space (M, 5) and all isometric embeddings (j) : X ^ M 
and (j)' : X' M. (Throughout the paper, when viewing a connected labeled graph G as a metric space, we 
will consider G to be rooted at its vertex of smallest label.) The main result of the paper is the following 
theorem. 

Theorem 2. As n ^ 00, 

(n-2/3z^,n-^/^M^) 4 (Z,M), 

for an appropriate limiting sequence of metric spaces M = (Mi,M2,...). Convergence in the second co- 
ordinate here is in the metric specified by 

/oo 

d(A,B)= \ \^^dGu{A,,B,f\ (1) 

for any sequences of metric spaces A = (^1,^2, . . .) and B = (5i, ^2, • • •)• 

We will eventually state and prove a more precise version of this theorem. Theorem 25, once we have 
introduced the appropriate limiting sequence. For the moment, we will remark only that convergence in the 
distance defined above implies convergence of distances in the graph. As a result, we will be able to give a 
precise answer to a great many asymptotic distance problems for G(n,p) inside the critical window. Before 
we can give an intuitive description of our limit object, we need to introduce one of its fundamental building 
blocks: the continuum random tree. 
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Figure 1. Left: a tree on [9]. Right: the same tree but labeled in depth- first order. 
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Figure 2. The height process and contour process associated with the tree in Figure 1. 



The continuum random tree 

In recent years, a huge literature has grown up around the notion of real trees. Here we will concentrate on 
the most famous random example of such trees, Aldous' Brownian continuum random tree (see [4, 5, 6]), 
and encourage the interested reader to look at [18, 21, 33] and the references therein for more general cases. 

The fundamental idea is that continuous functions can be used to encode tree structures. We will begin 
our discussion by considering a rooted combinatorial tree on n vertices labeled by [n] := {1, 2 . . . , n}. There 
are three (somewhat different) encodings of such a tree which will be useful to us. We will introduce two 
of them here and explain the third, which plays a more technical role, in the main body of the paper (see 
Section 2). For both of the encodings we will discuss here, we need to introduce the notion of the depth-first 
ordering of the vertices. For each vertex there is a unique path from v to the root, p. Call the vertices 
along this path the ancestors of v. Relabel each vertex by the string which consists of the concatenation of 
all the labels of its ancestors and its own label, so that if the path from p to v is p, ai, a2, . . . , relabel 
V by the string paia2 . . .amV. The depth-first ordering of the vertices is then precisely the lexicographical 
ordering on these strings. More intuitively, we look first at the root, then at its lowest-labeled child, then at 
the lowest-labeled child of that vertex, and so on until we hit a leaf. Then we backtrack one generation and 
look at the next lowest-labeled child and its descendents as before. See Figure 1. 

The first encoding is in terms of the height function (or, when the tree is random, the height process). 
For < z < n — 1, let H{i) be the graph distance from the root of the {i + l)-st vertex visited in depth- first 
order (so that H{0) = 0, since we always start from the root). Then the height function of the tree is the 
discrete function (i^(z),0 < i < n — 1). (See Figure 2.) (It is, perhaps, unfortunate that we talk about a 
depth-fi.Tst ordering and vertices having heights. The reason for this is that the two pieces of terminology 
originated in different communities. However, since both are now standard, we have chosen to keep them 
and hope that the reader will forgive the ensuing clumsiness.) 
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The second encoding is via the contour function (or, when the tree is random, the contour process). 
Starting at the root, trace the contour of the tree in depth- first order at speed 1 and record the current 
distance from the root, C{t) at time t. That is, instead of just hopping from vertex to vertex, take the 
shortest tree path between them at speed 1. (See Figure 2.) It is easy to see that every edge is now traversed 
twice, and so this produces a function (C(t), < t < 2{n — 1)). 

Note that the topology of the tree, but not the labels, can be recovered from either the height or contour 
function. 

Suppose now that we take a uniform random tree on [n], rooted at 1. Let {H^{i),0 < z < n — 1) be its 
height process and (C^(t),0 <t< 2{n — 1)) its contour process. 

Theorem 3 (Aldous [6]). Let (e(t),0 <t <1) he a standard Brownian excursion. Then, as n ^ oo, 
^(i^^(|nt|),0 < t < 1) 4 2(e(t),0 < t < 1) and ^(C^(nt),0 < t < 2) ^ 2(e(t/2),0 < t < 2). 

Here, convergence is in the space D([0,1],M+) of non-negative cadlag functions (right-continuous with 
left limits), equipped with the Skorohod topology (see, for example, Billingsley [10]). 

In fact, this convergence turns out to imply that the tree itself converges, in a sense which we will now 
make precise. We follow the exposition of Le Gall [33]. Take a uniform random tree on [n] and view it as 
a path metric space by taking the union of the line segments joining the vertices, each assumed to have 
length 1. (Note that the original tree-labels no longer play any role, except that we will think of the metric 
space Tn as being rooted at the point corresponding to the old label 1.) Then the distance between two 
elements a and a' of Tn is simply the length of the shortest path between them. We will abuse notation 
somewhat and write n~^/^T^ for the same metric space with all distances rescaled by n~^/^. 

In order to state the convergence result, we need to specify the limit object. We will start with some 
general definitions. 

A compact metric space (T, d) is a real tree if for oil x^y G T 

• there exists a unique geodesic from x to y i.e. there exists a unique isometry f^^y : [0, d{x, y)] T such 
that fx,y{0) = X and fx,y{d{x,y)) = y. The image of f^^y is called [[x,y]]] 

• the only non-self-intersecting path from x to y is [[x^y]] i.e. if q : [0^1] ^ T is continuous and injective 
and such that q{0) = x and q{l) = y then g([0, 1]) = [[x, y]]. 

An element x G T is called a vertex. A rooted real tree is a real tree (T, d) with a distinguished vertex p 
called the root. The height of a vertex v is d{p^v). By a leaf we mean a vertex v which does not belong to 
[[/), iL']] for any w v. Write C{T) for the set of leaves of T. Finally, write [[x,^[[ for fx^y{[0, d{x,y))). 

Suppose that h : [0, oo) [0, oo) is a continuous function of compact support such that h{0) = 0. Use it 
to define a pseudo-metric d by 

d{x ^ y) = h{x) -\- h{y) — 2 inf h{t)^ x,?/G[0, oo). 

x/\y<t<x\/y 

Let X ^ y if d{x^ y) = 0, so that ~ is an equivalence relation. Let T = [0, oo)/ ~ and denote by r : [0, oo) ^ T 
the canonical projection. If a is the supremum of the support of h then note that r(s) = for all 5 > a. 
This entails that T = r([0, a]) is compact. The metric space (T, d) can then be shown to be a real tree. Set 
p = r(0) and take p to be the root. 
Now take 

'2e{t) 0<t<l, 
t>l. 



h{t) 
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where, as in Theorem 3, and for the rest of the paper, (e(t),0 < t < 1) is a standard Brownian excursion. 
Then the resulting tree is the Brownian continuum random tree (or CRT, when this is unambiguous). Note 
that the role of the Brownian excursion (multiplied by 2) is that of both height and contour process. We 
will always think of the CRT as rooted. We then have the following. 

Theorem 4 (Aldous [6], Le Gall [34]). Let be the metric space corresponding to a uniform random tree 
on [n] and let T be the CRT. Then 

as n ^ oo, where convergence is in the Gromov-Hausdorff sense. 

It is perhaps useful to note here that the limit tree T comes equipped with a mass measure /i, which is 
simply the probability measure induced on T from Lebesgue measure on [0, 1]. Unsurprisingly, ja is the limit 
of the empirical measure on the uniform tree on [n] which puts mass 1/n on each vertex. Later on, we will 
use the fact that ii{C(T)) = 1, i.e., /i is concentrated on the leaves of the CRT [5, p. 60]. 

The limit of a critical random graph 

In this section we give a non-technical description of the limiting object in Theorem 2. Conditional on their 
size and surplus, components of G(n,p) are uniform connected graphs with that size and surplus. Moreover, 
as we have discussed, in the critical window, where p = + An~^/^ for some A G M, the largest components 
have size of order n^/^ and surplus of constant order. In order to understand better the structure of these 
components, we look at uniform connected graphs with "small" surplus. For definiteness, we will consider a 
uniform connected graph on m vertices with surplus s. 

Such connected graphs always possess spanning subtrees. A particular one of these, which we will refer to 
as the depth-first tree, will be very useful to us. As its name suggests, this tree is constructed via a depth-first 
search procedure (which we will not detail until later). The depth- first tree of a uniform random connected 
graph with s surplus edges is not a uniform random tree, but has a "tilted" distribution which is biased (in 
a way depending on s) in favor of trees with large area (for "typical" trees, this is essentially the sum of the 
heights of the vertices of the tree). We will define this tilting precisely later, and will then spend much of 
the paper studying it. 

The limit of a uniform random tree on m vertices, thought of as a metric space with graph distances 
rescaled by m~^/^, is the continuum random tree. It turns out that the limit of the depth- first tree associated 
with a connected component with surplus 5, with the same rescaling, is a continuum random tree coded by 
a Brownian excursion whose distribution is biased in favor of excursions having large area (where area now 
has its habitual meaning; once again, the bias depends on s). 

The difference between the depth- first tree and the connected graph is precisely the s surplus edges. The 
depth- first tree is convenient because not only can we describe its continuum limit but, given the tree, it is 
straightforward to describe where surplus edges may go (we call such locations permitted edges). Indeed, the 
surplus edges are equally likely to be any of the possible 5-sets of permitted edges. A careful analysis of the 
locations of the surplus edges in the finite graph leads to the following surprisingly simple limit description 
for a uniform random connected graph with surplus s. Take a continuum random tree with tilted distribution 
and independently select s of its leaves with density proportional to their height. For each of these leaves 
there is a unique path to the root of the tree. Independently for each of the s leaves, pick a point uniformly 
along the path and identify the leaf and the selected point. (Note that we identify the points because 
edge-lengths have shrunk to in the limit.) 

Having thus described the limit of a single component of a critical random graph, it remains to describe 
the limit of the collection of components. The key is Aldous' description of the limiting sizes and surpluses 
of the components in terms of the excursions above of the reflected Brownian motion with parabolic drift. 
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The excursion lengths give the hmiting component sizes. The auxilhary Poisson process of points with unit 
intensity under the graph of the reflected process gives the limit of the numbers of surplus edges. In fact, 
more is true. The excursions themselves can be viewed as coding the sequence of limits of depth- first trees 
of the components; the locations of the Poisson points under the excursions can be seen to correspond in a 
natural way to the locations of the surplus edges. Intuitively, the successive excursions are selected according 
to an inhomogeneous excursion measure associated with the process. Under this measure, the length and 
area of an excursion are related in precisely the correct "tilted" manner, so that, conditional on an excursion 
having length a and s Poisson points, the metric space it codes has precisely the distribution of the limit of 
a uniform random connected graph on ~ an?^^ vertices with s surplus edges, whose edge-lengths have been 
rescaled by n~^/^. 

The diameter of random graphs 

The diameter of a connected graph is the largest distance between any pair of vertices of the graph. For a 
general graph G, we define the diameter of G to be the greatest distance between any pair of vertices lying 
in the same connected component. 

The behavior of the diameter of G(n,p) for p = 0(l/n) is a pernicious problem for which few detailed 
results were known until extremely recently [47]. (For references on distances in dense graphs G{n^p) with 
p fixed, see [11].) In the subcritical phase, when p — p{n) = 1/n + A(n)n~^/^ and A — oo, Luczak 
[37] showed that the diameter of G{n,p) is within one of the largest diameter of a tree component with 
probability tending to one. Chung and Lu [14] focused on the early supercritical phase, when np > 1 and 
np < clogn. (Problem 4 in their paper asks about the diameter of G{n^p) inside the critical window.) 
More recently, Riordan and Wormald [47] have addressed the problem for the range p = c/n, c > 1 fixed, 
proving essentially best possible bounds on the behavior of the diameter for such p. They also have results 
for p = 1/n + An~^/^, with A = o(n^/^) which are also essentially optimal, but their error terms require that 
A = A(n) > e^^°^ ^ ^ oo as n ^ oo. When A is fixed, G(n,p) contains several complex (i.e., with multiple 
cycles) components of comparable size, and any one of them has a non- vanishing probability of accounting for 
the diameter. Nachmias and Peres [41] have shown that the greatest diameter of any connected component 
of G(n,p) is with high probability 9(n^/^) for p in this range; this result also follows trivially from work of 
Addario-Berry, Broutin, and Reed [1, 3] on the diameter of the minimum spanning tree of a complete graph 
in which each edge e has an independent uniform [0, 1] edge weight. 

In this paper, we demonstrate how Theorem 2 allows us to straightforwardly derive precise results on the 
diameter of G(n,p) for p = 1/n + An~^/^ with A fixed. (In a companion paper [2], we use the limit theory 
developed in this paper to answer many other questions about the distribution of distances between vertices 
in critical random graphs.) 

Theorem 5. Suppose that p = 1/n + An~^/^ for A G M. Let Df denote the diameter of the i-th largest 
connected component of G{n^p). Let = sup^Df denote the diameter of G{n^p) itself For each fixed i 
there is a random variable Di > with E [Di] < oo such that 

^-l/3^n 4 ^.^ 

Furthermore there is a random variable D > with an absolutely continuous distribution and E [D] < oo 
such that n-^l'^D'' ^ D. 

Plan of the paper 

The depth-first procedure is presented in Section 2. Given a connected component of G(n,p) of size m it 
yields a "canonical" spanning tree T^. The distribution of the tree is studied in Section 3. In Section 4, we 
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describe the graphs obtained by adding random surplus edges to the trees and introduce the continuous 
Hmit of connected components conditional on their size. Finally, very much as G{n,p) may be obtained by 
taking a sequence of connected components which are independent given their sizes, the continuum limit of 
G(n,p) can be constructed by first setting the sizes of the components to have the correct distribution, and 
then generating components independently. This is described in Section 5. 

2 Depth-first search and random graphs 

Let G = (V^E) be an undirected graph with V = [n] = {l,...,n}. The ordered depth-first search forest 
for G (started from vertex k) is the spanning forest of G obtained by running depth-first search (DFS) on 
G, using the rule that whenever there is a choice of which vertex to explore, the smallest-labeled vertex is 
always explored first. For clarity, we explain more precisely what we mean by this description, and introduce 
some relevant notation. For i > 0, we define the ordered set (or stack [see 15]) Oi of open vertices at time 
z, and the set Ai of the vertices that have already been explored at time i. We say that a vertex u has been 
seen at time i if u G Oi U Ai. Let q be a counter which keeps track of how many components have been 
discovered up to time i. 

oDFS(G) 

Initialization Set Oq = (1), Ao = 0, co = 1. 

Step i (0 < i < n — 1): Let Vi be the first vertex of Oi and let A/i be the set of neighbors of Vi in 

[n] \ {Ai U Oi). Set Ai-^i = AiU {vi}. Construct Oi+i from Oi by removing Vi from the 
start of Oi and affixing the elements of A/i in increasing order to the start of Oi \{vi}. If 
now Oi+i = 0, add to it the lowest-labeled element of [n] \ Ai-^i and set q+i = q + 1. 
Otherwise, set q+i = q. 

After step n — 1, we have On = 0. We remark that this procedure defines a reordering {vq^ . . . ^Vn-i} of 
[n], and for any G, oDFS(G) always sets vq = 1. We refer to DFS run according to this rule as ordered 
DFS. (The terminology lexicographic- DFS may seem natural; however, this has been given a slightly different 
definition by Cornell and Krueger [16].) We note also that we increment the counter q precisely when Vi is 
the last vertex explored in a component, so that (q,0 < i < n) really does count components. (We observe 
that if, in Step z, we affix the elements of A/i to the end of Oi \ {vi} instead of the start, we obtain the 
breadth-first ordering exploited by Aldous [7]; we will discuss this further in Section 5.) 

The forest corresponding to oDFS(G) consists of all edges xy such that for some i G {0, . . . , n — 1}, x is 
the vertex explored at step i (so x = Vi) and y G Ni. We refer to this as the ordered depth- first search forest 
for G. 

For z = 0, 1, . . . ,n - 1, let X{i) = \Oi \ {vi}\ - (q - 1). The process (X(z),0 < i < n) is called the 
depth-first walk of the graph G. (The terminology "walk" may seem odd here, but in the random context 
which is the focus of this paper, (X(z), < i < n) turns out to be something like a random walk.) 

We will particularly make use of these ideas in the case where G is connected. In that situation, we 
have Q = 1 for all < z < n and so the algorithm is simpler. (In particular, the set of open vertices only 
becomes empty at the end of the procedure.) Furthermore, since G is connected, the ordered depth- first 
search forest is now a tree, which we will refer to as the depth-first tree and write T{G). The depth-first walk 
(X(i), < i < n) now has the simpler representation X {i) = \Oi \ {vi}\ = \Oi\ — 1 for < i < n and can be 
interpreted as the number of vertices seen but not yet fully explored at step i of the oDFS(G) procedure. 
The following observation will be important later: the vertices in Oi \ {vi} all lie at distance 1 from the path 
from the root to Vi. Put differently, the vertices of Oi \ {vi} are all younger siblings of ancestors of Vi. 
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X{s) 




S 



Figure 3. The (interpolated) depth- first walk X{s) of the tree T displayed on the right is shown. We have emphasized 
the integral points that contribute to the area a(T). The portions of the walk above the dashed lines correspond to 
the oDFS(Ti) and oDFS(T2) processes (started from xi and X2 respectively). 

We next consider running oDFS on a tree T = {[n]^ E). Of course, now the depth- first tree will be T 
itself. We define the area of a tree T to be 



The area a{T) corresponds to the number of integral points in {{i^j) '0<i<n^l<j< X{i)} (see 
Figure 3). For convenience, we also define a continuous interpolation: for s e [0, n — 1] 



It will sometimes be convenient to see X as a discrete excursion, and we extend X to all of by setting 
X{s) = for all 5 > n — 1. Say that an edge uv ^ E is permitted by oDFS(T) if, in the oDFS(T) procedure 
run on T, at some stage of the process, i and j are both seen but neither is fully explored: there exists 
zG{0,...,n — 1} such that u^v G Oi. The following lemma is then straightforward. 

Lemma 6. The number of edges permitted by oDFS(T) is precisely a(T). 

Proof. We proceed by induction on n. For n = 1, 2 the claim is clear. For n > 3, let xi, . . . , be the neighbors 
of 1 in T, listed in increasing order, and let Ti, . . . ,Tj be the trees containing xi, . . . respectively, when 
vertex 1 is removed from T. 

By its definition, the procedure oDFS(T) simply uncovers xi,...,^^, then runs oDFS(Tj), for each 
j = in this this order, but started (exceptionally) from xj in each case. In particular, for each 

j = 1, . . . , i, each edge from xj to x G Tj^^ k < j is permitted by oDFS(T). Thus, the total number of edges 
with one endpoint in {xi, . . . permitted by oDFS(T) is precisely 



n-l 



a{T) 



X{s) := X{[s\) + {s -[s\)- {X{[s\ +1)-X{[s\)). 



Write Xt and Xt^. , 1 < j < i in order to distinguish the depth-first walks on T and on its subtrees. By 
induction, it thus follows that the number of edges permitted by oDFS(T) is 
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since, for < /c < \Tj \ and ^ = |Ti | + • • • + \Tj-i \ + /c, the ^-th step of the oDFS(T) process explores a vertex 
Vi of the tree Tj and Xt{^) = i — j -\- {k) (see Figure 3). □ 



The next lemma characterizes the connected graphs G which have a given depth-first tree. This lemma 
essentially appears in [24], though that paper uses slightly different terminology and a different canonical 
vertex ordering for oDFS. The correspondence of Spencer [51] is a precise analog of our lemma when the 
tree extracted from the connected graph is constructed by breadth-first search rather than depth-first search. 
(Spencer used this correspondence to show that the so-called "Wright constants" [53] are essentially factorial 
weightings of the moments of the area of a standard Brownian excursion.) 

Lemma 7. Given any tree T and connected graph G on [n], T{G) = T if and only if G can be obtained from 
T by adding some subset of the edges permitted by oDFS(T). 

Proof. First, if T{G) = T then T is certainly a subgraph of G. Next, suppose that G can be obtained from 
T by adding a subset of the edges permitted by oDFS(T). We proceed by induction on /c, the number of 
edges of G not in T. The case k = is clear, so suppose k > 1 and let ^o, • • • , ^n-i be the ordering of [n] 
obtained by running oDFS(T). Let ViVj be the lexicographically least edge of G not in T (written so that 
i<j)- 

Now, vertex Vi is explored at step i of oDFS(T). By our choice of ViVj^ prior to step i the behavior of 
oDFS(T) and oDFS(G) is identical, so in particular Oi{T) = Oi{G). Furthermore, since ViVj is permitted 
by oDFS(T), and Vj is explored after Vi, we must have vj G OiiT) = Oi{G). Thus, ViVj ^ T{G), and so 
T{G) = T{G \ {viVj}). The "if" part of the lemma follows by induction. 

Finally, suppose that G contains an edge not permitted by oDFS(T), and let ViVj be the lexicographically 
least such edge (in the order given by oDFS(T)). Then as before, the behavior of oDFS(T) and oDFS(G) 
is necessarily identical prior to step z, so in particular Vj ^ Ai{T) = Ai{G). Furthermore, since ViVj is not 
permitted by oDFS(T), vj ^ Oi{T) = Oi{G). But ViVj e E{G), so we will have vj e Ni{G) and thus 
ViVj G T{G). Hence, T{G) ^ T, which proves the "only if" part of the lemma. □ 

Let T[^] denote the set of trees on [n] and write Gt for the set of connected graphs G with T{G) = T. 
Then it follows from Lemmas 6 and 7 that 

is a partition of the connected graphs on [n], and that the size of Gt is 2^^^\ Recall that the surplus of a 
connected graph G is the minimum number of edges that must be removed in order to obtain a tree, and call 
it s{G). Then, for any k G Z+, the number of graphs in Gt with surplus k is precisely i^^^^)- (We interpret 
(^) as if /c > a throughout the paper.) 

We use these ideas to give a method of generating a connected component on a fixed number m of vertices 
(which, without loss of generality, we will label by [m]). Fix p G (0, 1). First pick a labeled tree on [m] in 
such a way that P(T^ = T) ex (1-p)-^^^). Now add to each of the a(T^) edges permitted by oDFS(f^) 
independently with probability p, so that, given T^, we add a Binomial(a(T^),p) number of surplus edges. 
Call the graph thus generated G^. Let G^ be a connected component of G(n,p) conditioned to have size 
(number of vertices) m. 

Proposition 8. For any p G (0, 1) and m <n, G^ has the same distribution as G^, a connected component 
of G{n,p) conditioned to have size m (and vertices labeled by [m]). 

Proof. For a graph G on [m], we write s{G) = \E{G) \ — (m — 1), so if G is connected then s{G) is its surplus. 
Consider G^ as relabeled with [m] in the unique way that conserves the ordering of vertices. It is then 
immediate from the definition of G(n,p) that, for a connected graph G on [m], 

P(G^ = G) oc P(G(m,p) = G) = p^-i+^(^)(l - p)(?)--^+i-^(^) oc p'^^\l - p)-'^^\ 
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Also, by its definition, 



P((5^ = G) oc (1 -p)-^(^¥(G^ = G I T(G) = T) = (i - p)--(^y - p)-m--(^) 



which completes the proof. 



□ 



Corollary 9. Conditional on s(G^) = k > {), G^ is a uniformly chosen connected graph on [m] with 
m -\- k — 1 edges, irrespective of the value of p G (0, 1). 

3 Tilted trees and tilted excursions 

In the introduction, we observed that twice the standard Brownian excursion appears as the limit of the 
height process {H^{i)^0 < i < n — 1) and of the contour process (G"^(t),0 < t < 2{n — 1)) of a uniform 
random tree on [n] (see, e.g., [6, 33]). Let (X^(z), < i < n — 1) be the corresponding depth-first walk. Then 



as n ^ oo, with convergence in D([0, 1],M+) equipped with the Skorohod topology (see Marckert and 
Mokkadem [39]). (Note that the results in [39] are stated in the more general situation of an ordered 
Galton-Watson tree with an arbitrary finite-variance offspring distribution, conditioned to have n vertices. 
If the offspring distribution is taken to be Poisson mean 1 then the conditioned tree has precisely the metric 
structure of a uniform labeled tree.) It is no coincidence that the limits of these three processes should be 
the same: they are not only the same in distribution, but are actually the same excursion. 

Theorem 10 (Marckert and Mokkadem [39]). As n ^ oo, 



We will make considerable use of this fact. An essential tool in what follows will be the following estimate 
on the distance between the depth- first walk and the height process (Theorem 3 of [39]). 

Theorem 11 (Marckert and Mokkadem [39]). For any u > 0, there exist Uy and 7 > such that, for all 
n> rijj, 



In this section, we focus on understanding the distribution of the tilted trees T^. Note that in the case of 
critical G{n^p) the largest components have size m of order n^/^ and p ^ 1/n, so that we shall take p = p{m) 
of order m-^/^ We write = (X^(z), <i <m) and H"^ = (^^(z), < i < n) for the depth-first walk 
and height process of (in oDFS order). Although it is usually impossible to reconstruct the labelling 
from either or the structure of the trees (as unlabeled rooted ordered trees) can be recovered from 
either or H^. We start with a description of the scaling limit of these discrete excursions, which is 
closely related to the scaling limit of the corresponding processes, X^ and i^^, for uniform trees. 

Write £ for the space of excursions; that is, 

£ = {f e G(R+, R+) : /(O) = 0, 3 cr < 00 such that f{x) > V x G (0, a) and f{x) = Vx > a}. (2) 

Given a function / G G([0,cr],R+) with /(O) = f{a) = and f{x) > V x G (0,cr), we will abuse notation 
by identifying / with the function g ^ £ which has g{x) = /(x), < x < a and g{x) = 0, x > a. The 



^(X^([ntj),0 < t<l)^ (e(t),0 < t < 1), 



1 



(X-( Ln -J ), H^iln -J , G-(n •)) - (e( • ), 2e( • ), 2e( • /2)). 
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distance of interest for us on £ is given by the supremum norm: for a function / G C(]R^,M), we write 
11/11 = supg>o 1/(5)1. Let e^^^ = (e'^^^(5),0 < s < a) be a Brownian excursion of length a > 0. We omit the 
superscript in the case of a standard Brownian excursion (e(5),0 < 5 < 1). Note that, by Brownian scaling, 
we have 

(e^^^O < s<cr) = (Va- e(s/cr),0 <s<cr). 
Then the area under the excursion e^^^ is 



A{a):= I e^''\s)ds = (j^^^ C e{s)ds. 
Jo Jo 

The random variable A{2) has the so-called Airy distribution. This distribution has a rather complicated 
form but, for our purposes, it will suffice to note that its Laplace transform, ^ : C ^ C given by (j){z) = 
E[exp(— z e{x)dx)]^ is an entire function (see Janson [26] for details); in particular, it is finite for z = — 1. 
For cr > 0, we define the tilted excursion of length a, e*^^^ = {e^^\s),0 < s < a) G f , to be an excursion 
whose distribution is characterized by 

p . r.) ^ E[l{e(^)eg}exp(/;e(-)(s)d5)] 

\ ^""y E[exp(/;e(-)(s)ds)] ' ^' 

for S C f a Borel set. Here, the Borel sigma- algebra on S is that generated by open sets in the supremum 
norm || • ||. (Equation (3) gives a well-defined distribution since </>(—!) < 00.) As with the standard Brownian 
excursion, we will omit the superscript whenever the length of the tilted excursion is 1. As previously, let 
D([0,cr],R+) be the space of non-negative cadlag paths on [0,cr], equipped with the Skorohod topology. 

Theorem 12. Suppose that p = p(m) is such that mp^^^ a as m ^ 00. Then, as m ^ 00, 
((m/a)-^/'X^([(m/a)tJ),0 < t < a) 4 (e(")(t),0 < t < a). 



The proof consists in transferring known limits for uniform random trees over to tilted trees. We must first 
ensure that the change of measure defined by P(T^ = T) ex (1 -p)-«(^) is well-behaved when p = 0{m-^^^). 
To do so, we shall in fact first derive tail bounds on the maximum of the depth-first walk. (Khorunzhiy and 
Marckert [31] have proved similar bounds for the maxima of Dyck paths, which are essentially the contour 
processes of Catalan trees.) Let be a uniformly random tree on [m], and let be the associated 
depth-first walk. 

Lemma 13. There exist constants C >0 and a > such that for all m G Z+ and all x >0, 

P(||X^|| > xvm) < Ce-"^'. 

Proof. To prove the lemma we use a connection with a queueing process which is essentially due to Borel 
[13]. Consider a queue with Poisson rate 1 arrivals and constant service time, started at time zero with a 
single customer in the queue. We may form a rooted tree (rooted at the first customer) associated with the 
queue process, run until the first time there are no customers in the queue, in the following manner: if a 
new customer joins the queue at time t, he is joined to the customer being served at time t. We denote 
the resulting rooted tree by Then ^ is distributed as a Poisson(l) Galton-Watson tree and, hence, 
conditional on its size being m, as Tm (viewed as an unlabeled tree) [13]. Viewing the arrivals as given 
by a Poisson process Q of intensity 1 on R+, we may also associate an interpolated random walk to the 
process, by St = \Qn [0,t)| — for t G M+. Then is precisely the first time t that 5*^ = —1, i.e., that 
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|Q n [0, t)| = t — 1. Furthermore, {/St, t = 1, 2, . . . , |^| — 1} is distributed precisely as the depth-first walk of 
(It is not equal to the depth-first walk, but to the breadth- first walk discussed in Section 5.) 
Using the above facts, we may thus generate ^ conditional upon |^| = m as follows. First let 
Ui^ . . . ^Um-i be independent and uniformly distributed on [0,m], and let U = {/7i, . . . , /7^_i}, so that 
U is distributed as |Q H [0,m)| conditional on |Q H [0,m)| = m — 1. Next, let ja G {l,...,m} minimize 
lU n [0, — /i, and apply a cyclic rotation by —fi to all the points in U to obtain W . In other words, let 

U' = {{Ui — jii) modm, . . . , {Um-i — modm}. 

Then m is precisely the first time t that D [0,t)| = t — 1, and W is distributed precisely as |Q H [0,m)| 
conditional on = m (see [19]; this type of "rotation argument" was introduced in [9]). 

Now write for the depth-first walk of T^, and let = H [0, i)| — i for z = 1, . . . , m. By the above, 
{Xf^, . . . , X^_^, —1} and {Xi, . . . , X^} are identically distributed. In particular, 

IIX^II^ max X, 

0<z<m 

= max (|ZYn [0,z)| -z) - min {\U H [0,i)\ - i) - 1 

0<i<m 0<i<m 

max (IZY n [0,i)| - i) + max (IZY n [z,m)| - (m - i)) 

0<i<m 0<i<m 

< sup {\Un[0,t)\-t)^ sup {\Un[t,m)\- {m-t)). 

0<t<m 0<t<m 

The two suprema in the latter equation are identically distributed, and so 

P(||X^|| >2x) <2P( sup {\U n[0, t)\-t)>xY (4) 

\0<t<m J 

For any fixed t G [0, m], let Pt be the event that there is a point of U at t. For fixed x > and t G [0, m], 
Et^x be the event that \U [0, t]| = + but that H [0, s)| < s -\- x for all < s < t (so in particular, 
there is a point at t). We then have 

^{Et,x \ \un[o,t]\ = \t^x],Pt) = ¥ {\u n[o,s)\ < s ^ X \/ < s < t \ \u n[o,t)\ = \t ^ x] - 1) . 

Applying the ballot theorem for stochastic processes to the latter probability (see, e.g., [52] or p. 218 of [29]), 
we obtain the bound 

V{Et,, \ \Un[0,t]\=\t + x],Pt)<l - ^^ + ^1-^ <1. (5) 

t -\- X t 

Furthermore, in order for {supQ<^<^(|Z// H [0,t)| — t) > x} to occur, Ef^^ must occur for some < t < m. 
Since an infinitesimal interval [t^t + dt] contains a point of U with probability dt{m — l)/m, and for each t, 
\V{ n [0,t)| is distributed as Bin(m — l,t/m), it follows from (4) and (5) that 



P(||X-|| >2x)<2 ¥{Et,x)dt 
Jo 

<2 / -F{\un[o,t)\ = \t^x]) ^ U t 

Jo t m 

- Jo t 

where the last inequality follows from Chernoff 's bound for Binomial random variables (see, for example, 
[28]). The conclusion follows easily for x < m/2, and thus for all x (since we always have ||X^|| < m). □ 
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Lemma 14. There exist universal constants > such that the following holds. Fix c > and > 0, 
and suppose that p G (0, 1) and m G Z+ are such that p < cttt'^I^ . Let Tm he a uniform random tree on [m]. 
Then 

Proof. Fix c and ^ as above, and let A = 2c^. We may clearly restrict our attention to m sufficiently large 
that p < cm~^^'^ < 1/2. For all such m we have 

so it suffices to prove that sup^>]^ some universal constants K and But 

a(T^) < m||X^||, and so 

P(m-^/2a(T^) >x)< P(m-i/2||X^|| > x). 
Since ||X^|| < m, it follows by Lemma 13 that 



E 



< 



/ e^^P(m-i/2||X™|| > x)dx < 
Jo Jo 



,1/2 



(6) 



Completing the square in the last integrand so as to express the right-hand side as a Gaussian integral yields 
the claim with = a~'^ and K = C^yTr/a. □ 

Proof of Theorem 12. We assume a = 1 for notational simplicity; the general result follows by Brownian 
scaling. Again, let (X^(z),0 < z < m) be the depth- first walk associated with a uniformly-chosen labeled 
tree. Its area is a(T^) = YlT=Q -^^(0- We know from Theorem 10 that 



(m-^/2X^([mtJ),0 <t<l)^ (e(t),0 < t < 1). 



(7) 



We will henceforth want to think of as a function in D([0, 1],]R+) and will write X'^{s) instead of 
m-^/'^X'^{[ms\). For h e D([0, 1],M+), let I{h) = h{t)dt] / is a continuous functional of the path h. It 
then follows from (7) that 

-I rn-l ^1 ^1 

m-^^^a{T^) = ^-^y^X^{i)= X^{t)dt^ e{t)dt, 
^ ' ^ Jo io 

jointly with the convergence in distribution of the depth- first walk. 

Now suppose that / : B([0, 1],M+) R+ is any bounded continuous function. Then 



E 



/ (m-^/2X-(Lm-J)) 



E 



/(X^)(l - p)-^'^' ^0 ^^{t)dt 



E 



and 



E[/(e)] = 



E 



(1 _^)-m3/2/^iX-(t)dt 

/(e) exp (^jl e{x)dx 



E 



exp 



[lo e{x)dx^ 



Since X^{t)dt -i e{t)dt and p ^ m we also have 



exp 



e{t)dt . 



By Lemma 14 the above sequence is uniformly integrable, and so we can deduce that E[/(m [m • J )] 



E[/(e)], which implies that {m-^^'^X'^{lmt\),0 < t < 1) ^ e. 



□ 



13 



Unsurprisingly, as in the case of uniform trees, we also have convergence of the height process of tilted 
trees towards tilted excursions. 

Theorem 15. Suppose that p = p{m) is such that mp^^^ a as m ^ oo. Then, as m ^ oo, 
((m/(7)-^/2^^([(m/a)sJ),0 < s < a) ^ {2e^''Hs),0 < s < a) 

mD([0,cr],] 



The theorem follows straightforwardly from the following lemma and Lemma 14, much as Theorem 12 
folllowed from Lemma 14; its proof is omitted. 

Lemma 16. Suppose that p = p{m) is such that mp^^'^ a as m ^ oo. For m > 1 let be a tree on 
[m] sampled according to the distribution IP(T^ = T) cx (1 — Let and be the associated 

depth-first walk and height process. Then, there are constants K and tuq > 0, such that for all m> ttiq. 

P(||X^ - ^^/2|| > m^/^) < Km-^^^^. 

Proof Let be a tree on [m] chosen uniformly at random, and write and for its depth- first walk 
and height process. Then, by definition. 



E[l{ll^._^./2||>^3/8}(l-p)--(^-)] 



E[(l-p)-«(^-)] 

Distinguishing between the trees with a "large" area, a{Tm) > m^^/^^, and the others, we obtain 

E[l{a(T^)>^25/i6|(l P(||X- - H^/2\\ > m3/8)(l -p)— ''''' 



P(||X^-^^/2|| >m^/^) < 



E[(l-p)-«(^-)] 



E[(l-p)-«(^-)] 



.1/8 ^1/16 



E[(l-p)-«(^-)]^ 



where the second inequality follows from the bound P(||X^ — H'^/2\\ > m^/^) < e ^'^^'^ ^ for some 7 > 
and all m large enough, which is obtained from Theorem 11. By Markov's inequality. 



P(||X^ - ^^/2|| > m^l^) < E[m-^/2^(f^)] • m"^/^^ ■ 



-7(l+o(l))mi/« 



E[(l-p)-«(^-)]' 

Finally, by Theorem 12 and Theorem 10 together with Lemma 14 we have 



E 



E 



e^''\s)ds 



< 00 and E 



{1-p) 



-a(T^) 



E 



exp 



e^''\s)ds 



as m ^ 00. 



>0, 
□ 



4 The limit of connected components 

4.1 Generating connected components of the random graph 

In this section, we discuss two different ways of constructing a connected graph from a tree. 

Bijective encoding of connected graphs. Consider a connected labeled graph G on m vertices, and recall 
that running the oDFS process on G produces the depth- first tree T{G). Recall that the edges permitted 
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by oDFS are those between vertices which both he in the stack Oi for some i. By Lemma 7, G can be 
recovered from T = T(G) by adding some specific subset of the permitted edges. 

By Lemma 6, we may consider permitted edges to be in bijective correspondence with the integral points 
(i, j) lying above the x-axis and under the depth-first walk X = 1 < i < m) encoding T. So suppose 

that we take the depth-first walk X and place a mark at the point (i, j) if there is an edge in G between 
Vi, the i-th vertex explored in oDFS order (that is, the first vertex of O^), and the vertex lying in position 
\Oi \ — j -\- 1 in Oi (equivalently, the j-th vertex of Oi when counting from the end (or bottom) of the stack). 
Call the resulting object a marked depth-first walk. Then clearly this gives a bijection between marked 
depth-first walks and connected graphs G. 

For a tree T with depth-first walk X and a pointset Q C Z+ x Z+, let G^ = G^{T, Q) be the graph 
obtained by adding to T the edges corresponding to the points in Q H X where, for convenience, we define 

5 n / := {(x, y) eS:{)<y< f{x)} for ah 5 C R+ x R+ and / : R+ ^ R+. (8) 

A Binomial pointset of intensity p is random subset of Z+ x Z+ in which each point is present independently 
with probability p. The following lemma follows straightforwardly from Proposition 8. 

Lemma 17. Letp = p G (0, 1). LetT^ he a tree on [m] sampled in such a way that¥{T^ = T) cx (l—p)~^^^\ 
Let (X(z),0 < i < m) he the associated depth-first walk. Let C Z+ x Z+ he a Binomial pointset with 
intensity p, independent ofT^. Then, the graph G^(T^, Q^) has the same distrihution as G^, a connected 
component of G{n^p), n>m, conditioned to have m vertices. 

It is convenient to devise a similar construction which uses the height process in place of the depth-first 
walk. Although this second construction does not provide a bijection between graphs and "marked height 
processes", it turns out to be closely related to the first construction, and useful to us in the asymptotics 
which follow. 

Generating connected graphs from height processes. Consider a tree T on [m] whose height process 
(here and below, by "height process" we always mean "height process in oDFS order") is given by {H{i)^ < 
i < m). Given a discrete pointset Q C Z+ x Z+, we define a graph G^ = G^ {T^ Q) on [m] using the height 
process as follows. Take the tree T and for (z, j) G Q with < 2j < H{i), add an edge between the vertex 
Vi and the vertex at distance (2j — 1) from the root '^o on the unique path to Vi. (As will be clarified below, 
the factor of two here is essentially the factor of two difference between the limits of the depth-first walk 
and the height process in the case of random trees; c.f. Theorem 10 and Lemma 16.) Note that upon adding 
these edges, we may produce a graph G^ such that T(G^) ^ T. Moreover, the procedure G^{T^ •) does 
not provide a bijection, since it is not onto (one of the endpoints of every edge added is at odd distance from 
the root). 

See Figure 4 for an example of the two constructions. 

The graphs G^(T^, Q^) and G^(T^, Q^) turn out to be very similar, and G^ {T^^ Q^) is much easier to 
analyze than G^ {T^^, Qp). In particular, the limiting object corresponding to G^ {T^^ Q^) is very natural. 
Write ^ for Lebesgue measure on R+ x R+. 

Lemma 18. Let p = p{m) he such that mp^^^ cr as m ^ oo. Pick a laheled tree on [m] in such a 
way that P(f^ = T) cx (1 - p)-«(^) and let H"^ he the associated height process. Let C Z+ x Z+ he a 
Binomial pointset of intensity p. Let Vm = {{{rri/a)~^i^ {rri/a)~^^'^j) : (i^j) G Q^}. Then 

(((m/a)-i/2^^(L(m/(7)tJ),0 < t < a),Vm H {{m/a)-^/^H'^{[{m/a)t\)/2,0 < t < a)) ^ {2e^^\V n e^^^) 

as n ^ oo, where V is a homogeneous Poisson point process with intensity measure ^ on R+ x R+^ and V 
is independent of e^^\ Convergence in the first co-ordinate is in D([0, <j], R+)^ and in the second co-ordinate 
is in the sense of the Hausdorff distance. 
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X{t) 




Figure 4. Part of the depth- first walk of a tree T is shown with the pointset Q = {(i, 2)}. In the walk picture, the 
boxes represent the nodes in Oi. In the tree T, the black vertices have been fully explored. The two graphs (T, Q) 
and (T, Q) resulting from the correspondences with marked depth- first walk X and height process respectively, 
are presented: on the left, G^ (T, Q) is obtained from T by joining Vi and the second vertex from the end of d; on 
the right, G^ {T, Q) is obtained by joining Vi and the vertex at distance 3 from the root on the to Vi in T. 



Proof. We assume for notational simplicity that a = 1; the result for general a follows by Brownian scaling. 
(Note that the point process is rescaled independently of a.) Let A: > 1 and let Ai, ^2, . . . , C [0, 1] x R+ 
be disjoint measurable sets. Then for any n > 1 and any 1 < i < the discrete counting function 

NM) = #{(LmxJ, Lm^/'^J) e : {x,y) G A,} 

is a Binomial random variable with parameters r]rn{Ai) = #{([mxj, [m^^'^y\) : (x, G ^i,0 < [m^^'^y\ < 
lmx\ } and p. Since 

and m^^'^p ^ 1 as m ^ oo, Nm{Ai) -i Poisson(^(74i)) for 1 < i < k. Moreover, the random variables 
Nm{Ai) ^ Nm{A2) , . . . , Nm{Ak) are independent, since they count the points of in disjoint sets. Thus 
T^m V in distribution [17, 32]. 

Suppose now that for each m > 1, /m ^ [0, 1] ^ is a continuous function, and that fm converges 
uniformly to some function / : [0, 1] M^. Then for any open set A C [0, 1] x R+, {{x^y) G A : < y < 
fm{x)} {{x^y) e A : < y < f{x)} (in the sense of the Hausdorff distance). It follows that Vm ^ fm ^ 
Vnf in distribution, since the Poisson process almost surely puts no points in the set {{x^y) e A : y = f{x)}. 

Now suppose that ' {0,1,..., m} Z+ and (m-^/2^^( [mtj ), < t < 1) ^ {g{t),0 < t < 1) 
in D([0, 1],M+), where g is continuous. Then letting gm : [0,1] be the continuous interpolation of 

{m~^^'^ gm{mt) : t = 0, m~^, 2m~^, . . . , 1), we also have gm ^ 9 uniformly. Moreover, 

Vn n (m-i/2^^([mtj),0 < t<l)=Vm^~gm 

since the functions agree at lattice points. So we obtain that Vm H (m~^/^^rn(L^^J)7 0<t<l)-i7^n^. 

Finally, since Vm and H"^ are independent, and (m-^/2^^([mtJ), < t < I) ^ (2e(t),0 < t < 1) in 
distribution by Theorem 15, it follows easily that 

Vm n (m-^/2^^([mtJ)/2, < t < 1) 4 7^ n e, 
jointly with the convergence of the height process. □ 
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4.2 The limit object 

For p = p{n) = 1/n + An~^/^, A G M, any single one of the largest components of the random graph G{n^p) 
has (random) size m = m{n) ~ av?!'^ [7, 11, 28]. Conditioned on having size m, such a connected component 
has the same distribution as G^(T^, Q^). As we have already remarked, Lemma 16 shows that and 
H'^ /2 are close for large m. Suppose now that we can prove that, for large m, {Tf^, Q^) and {T^^ Q^) 
are also "close" in an appropriate sense. Lemma 18 suggests that G^ {T^^ Q^) should have a non-trivial limit 
in distribution when distances are rescaled by n~^/^, which should, thus, also be the limit in distribution of 
similarly rescaled. We will now define this limit object, Ai^^\ by analogy with G^ {T^^ Q^)- 

Let T be the real tree encoded by a height process /i, as described in the introduction. Recall that we 
think of T as a metric space, rooted at a vertex we call p. Recall that r denotes the canonical projection 
r : [0, cr] ^ T . Let Q be a pointset in x R+ such that there are only finitely many points in any compact 
set. To each point ^ = {i^ i^^) G Q H there corresponds a (unique) vertex r(^^) G T of height h{(,^). 

Now let t(^^,^^) be the vertex at distance 2^^ from p on the path [[p, t(^^)]]. Define a new "glued" metric 
space g{h, Q) by identifying the vertices r(^^) and t(^^,^^) in T, for each point ^ G Q fl and taking 

the obvious induced metric. 

Now let 7^ be a Poisson point process with intensity measure ^ on x R+, independent of e^^\ a tilted 
excursion of length a. Note that V almost surely only has finitely many points in any compact set. Then 
we define the random metric space Ai^^"^ = g{2e^^\V) and write Ai = M.^^\ 

In Theorem 22 below, we will see that A^^^^ is indeed the scaling limit of a connected component of G(n,p) 
conditioned to have size m ~ av?!'^ . In order to do this, we show that the metric spaces corresponding to 
G^(T^, OF) and G^(T^, QF) are very likely close in the Gromov-Hausdorff distance for large m, and that 
the metric space corresponding to G^(T^, Q^) and MS^^ are also close in the Gromov-Hausdorff distance 
for large m ~ av?!'^ . 

The scaling limit of the whole of G{n^p) is then be a collection of such continuous random components 
with random sizes; the components of size o(n^/^) rescale to trivial continuous limits. The proof of this is 
dealt with in Section 5. 

It is possible to give a more intuitively appealing description of M.^^\ Take a continuum random tree 
with tilted distribution and independently select a random number P of its leaves, each picked with density 
proportional to its height. For each of these leaves there is a unique path to the root of the tree. Independently 
for each of the selected leaves, pick a point uniformly along the path and identify the leaf and the selected 
point. Before we move on, we justify this description. 

Proposition 19. Using the same notation as above, the following statements hold. 

1. Given e^^\ encoding the real tree T , for any £,y ) eVr\ eS"'^, has density 

/;e(-)(s)d5 

on [0,cr]. Moreover, the vertex r{^^) is almost surely a leaf of T. 

2. We have df{p^f{^^,^y)) := 2^^ = Udf{p^r{^^)) , where U is a uniform random variable on [0^1], inde- 
pendent of and e^^^ . 

3. Given e^^\ the number of vertex identifications in M^^^ has a Poisson distribution with mean e^^\u)du. 

Proof. It is useful to keep in mind that there are two (independent) sources of randomness: one which gives 
rise to the excursion e^^^ (and hence the tree T), and another which gives rise to the point process V. 

Given e^^^ and that (^^, ^^) is a point of 7^ H e^^\ it is straightforward to see that has claimed density 
on [0, cr]. Hence, 
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Recall that ja is the natural measure induced on T from Lebesgue measure on [0, a]. Since the distribution of 
e'^^^ is absolutely continuous with respect to that of 6*^^^ a Brownian excursion of length a, and /i(>C(T)) = a 
[5, p. 60], we must also have ii{C(T)) = a. It follows that 

nr{n e >c(t) 1 1) = 1 

for almost all T. Hence, r(^^) is almost surely a leaf. 

Again using the fact that (^^,^^) is a point of 7^ fl e^^\ given e^^^ and is uniformly distributed in 

[0, e'^^^(^^)]. The second statement follows since df{p^T{^^)) = 2e^^\^^). The third statement is immediate 
from the fact that the number of points in 7^ H e^^^ has a Poisson distribution with mean e^^\u)du. □ 

4.3 Bounding the dissimilarity between and 

In this section, we provide deterministic bounds on the distances 

^G//(G^(Ti, Qi), G^(Ti, Qi)) and dcHigihu Qi),^(/^2, Q2)) 

for fixed labeled trees Ti and T2, fixed height processes hi and /12 and pointsets Qi and Q2. These bounds 
are used in Section 4.4 to obtain the scaling limit of connected components G^. 

Consider the depth-first walk (X(z),0 < i < m) and the height process {H{i)^0 < z < m) of a (deter- 
ministic) tree T on [m], together with a finite discrete point set Q C Z+ x Z+. The two excursions and the 
pointset give rise to two graphs := G^{T^ Q) and := G^{T, Q). We will, for the moment, view G^ 
and G^ as metric spaces on {0, 1, . . . , m — 1}. We do this in the following way: first relabel the vertices 
of T with {0, 1, . . . , m — 1} in depth-first order. Then, in each case, add the relevant surplus edges and, 
finally, take the metrics to be the graph distances d^ {■^■) and d^ {-^ •) induced by G^ and G^ respectively. 
We will abuse notation by continuing to refer to these metric spaces as G^ and G^ . Note that we have 
given an injection of the two graphs/metric spaces into a common set of points V and so we can measure 
the distortion 

sup \d^{x,y) - d^{x,y)\ 

(with respect to this fixed injection) between them. This distortion provides an upper bound on the Gromov- 
Hausdorff distance (see, e.g., [33]). 

Lemma 20. Suppose that QnX = Qn {H/2), and write k = \Qn X\ = \Qn {H/2)\. Then 

dGH{G'',G'')<k{\\X-H/2\\^2). 

Proof. For the present proof, we will think of a path in a graph as an ordered list of vertices (which may 
contain repeats) which is such that there is an edge present between each pair of adjacent vertices. We will 
write for the operation of concatenation on ordered lists so that, for example, (1, 2, 3)0(2, 4) = (1, 2, 3, 2, 4). 
Fix two vertices x and and consider a shortest path 7r^(x, y) between x and y in G^ . Let (ui^Wi), < i < £ 
be the sequence of endpoints of the surplus edges in 7r^(x, y) in the order in which they appear when going 
from X to y. Then we have 

£-1 

7T^{x,y) = 7Tt{x,Ui) ^ (^{Ui.Wi) ®7TT{Wi,Ui^i)^ {Ui.Wi) ®7TT{Wi,y), 
i=l 

where 7Tt{u^v) denotes the unique shortest path between u and v in the tree T. We will now define a path 
7r^(x, y) in G^ between the nodes x and y. The path 7r^(x, y) will not necessarily be a shortest path from x 
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Figure 5. Left: a graph and the shortest path between vertices x^y. Right: the corresponding path (which is 
no longer a shortest path) between x and y in the graph . In both graphs, the shortcut edges are bold, and the 
remaining edges of the paths are dashed. 



to y in G^, but it will go through the vertices Ui^Wi^l < i < i in the order in which they appear in 7r^(x, y). 
(The relation between the paths 7r^{x,y) and 7r^{x^y) is depicted in Figure 5.) In particular, let 

£-1 

7r^(x, y) = TTrix, ui) © ^ (^7r^'^(iXi, Wi) © irriwi, iXi+i)) © 7r^'^(ii^, Wi) © TTriwe, y), 
where 7r^'^('Ui, i^i), 1 < i < ^, are paths that we will define in a moment. We clearly have 

\n"{x,y)\ - |7r^(x,y)| = ^(|7r^'^(^.i,«;0l " !)■ (9) 

i=l 

For I < i < Ui and Wi are joined in by an edge corresponding to a point G Q H X. By assumption, 
G Q n {H/2) so also induces an edge fi in and, by definition, at least one of Ui and Wi must be 
an endpoint of fi. Assume that fi joins Ui to a vertex wf (an analogous argument works if fi is instead 
attached to Wi). Then we define tt^'^ (ui^Wi) to be the path that consists of the edge (ui^wf) followed by 
the shortest path 7rT{wf ^Wi) between wf and Wi in T. Then, we have 

|^^'^(^i„^,)l - 1 = \7rT{wf,w,)\. (10) 

Now, by definition, wf lies on the path TTrip^Ui) in T between Ui and the root of T. Since when Ui is the 
first element of the oDFS stack, the other elements are all at distance 1 from the path from the root to Ui^ 
Wi has a neighbor wl on that same path (see Figure 4). It follows that \7rT{wf , Wi)\ < \H{wi) — H{wf) \ + 1. 
The edges {ui, Wi} in G^ and {ui^ wf} in G^ correspond to the same point G Q, so H{wf) is also twice 
the index of Wi (in reverse order) in the set Oi. This number is exactly 2X{wi) + 2 since the nodes in Oi 
which precede Wi in oDFS order are exactly those that are still to explore when we arrive at Wi^ and there 
are X{wi) of them. Then, 

\7TT{wf,Wi)\ < \H{mi)-2X{wi)\^3< \H{wi) - 2X{wi)\ ^ 4 < ||i^-2X||+4. 

Using this bound together with (9) and (10), we finally obtain that 

d'^ix, y) - d'^ix, y) < 21{\\X - H/2\\ + 2) < 2k{\\X - H/2\\ + 2). 

A symmetric argument shows that d^ {x^ y) — d^ {x^ y) < 2k{\\X — H/2\\-\-2). The claim then follows since the 
Gromov-Hausdorff distance is bounded by the distortion, i.e., dcHiG^ ^ G^) < \ sup^^ |(i^(x, y) — d^ {x, y)\ 
(see, for example, [33]). □ 
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When the pointsets used to define two graphs are not identical, we measure their dissimilarity with the 
Hausdorff distance. Let Ti and 7^ be two real trees encoded by the height processes (/ii(5),0 < s < a) and 
(/i2(5),0 < s < (j) respectively. Let ri : [0,cr] T\ and T2 : [0,cr] ^ 72 be the canonical projections, as 
discussed in the introduction, and let p\ = ti(0), p2 = ^2(0). Write d\ and d2 for the pseudometrics on [0, a] 
given by 

= + - 2 inf /ii(r) 

sf\t<r<s\Jt 

d2{s,t) = h2 is) + h2 {t) - 2 inf /12 (r) , 

sAt<r<sVt 

and write dr^ and (irs for the metrics induced on Ti and 72 respectively. Suppose that Qi and Q2 are two 
pointsets in R+ x R+. Let d = ^(/ii, Qi) and G2 = ^(/i2, Q2). 

Lemma 21. Suppose that k = \Qi D {hi/2)\ = IQ2 H (/i2/2)| an^i S = dniQi H (/ii/2), Q2 D {h2/2)). Then, 
dGH{GuG2)<l{k^l)l6^12\\hi-h2\\^4 sup \h2{r) - h2{r')\] . 

Z y \r-r'\<S J 

Proof. The bound is again obtained by estimating the distortion induced by the natural correspondence 
1Z = {{ri{s),r2{s)) : < 5 < cr} between the vertices of Ti and 7^. For G [0,cr], let 7r^{s,t) be a shortest 
path between ri{s) and ri{t) in Gi (we consider this path to be an ordered continuous set of vertices). The 
argument is now roughly the same as that of Lemma 20: we construct a path 7r^(5,t) from r2{s) to r2{t) 
that is not too much longer than 7r^(5,t), by "splitting 7T^{s^t) up at the shortcuts". 

Parts of 7r^(s,t) follow geodesies in the real tree 7i, and translating these to 72 poses no challenge not 
met in Lemma 20. The difficulty arises when 7r^(5,t) uses shortcuts produced by the vertex identifications 
induced by Qi H {hi/ 2). Of course, these shortcuts are not present in G2, which has its own shortcuts. Recall 
that |Qin(/ii/2)| = |Q2n(/i2/2)| = k, let ^1,^2, ... ,6 be the points of Qin(/ii/2), and let r/1,7/2, . . . , 7?/c be 
the points of Q2 H (/12/2). Since dn^Qi H (/ii/2), Q2 fl (/12/2)) = (5, by relabeling if necessary, we can arrange 
that sup;L<^<j^ ll^i ~ ^ building 7r^(5,t), we will then use the shortcut induced by rii whenever the 

shortcut induced by is used in 7r^(s,t). 

In order to formalize this strategy, we need to give a sense to the notion of the direction in which we 
traverse a shortcut. Let denote the geodesic from uio v inTi or 72, where which tree we mean will 

be obvious from context. Let be the concatenation operator. Suppose that 7r^(5,t) uses ^ shortcuts, and 
assume without loss of generality that we have labeled the points of Qi H {hi/ 2) so that the shortcuts along 
the path 7r^(s,t) are induced by = (^f,^^) for 1 < z < In each such identification, one of the vertices is 
Ti(^f) and the other is Ti(^f,^f) (which is the vertex at distance 2^f from pi along the path [[pi, Ti(^f )]]). 
For each i G {1,...,^}, there exists (at least one) ^ [^i^] such that Ti(Ci) = 'ri(^f,^f); pick one such 
arbitrarily. 

Now 7r-^(5,^f) is a sub-path of T:^{s^t)\ it is precisely the portion of 7r-^(s,t) connecting ti{s) to ri(^f). 
Furthermore, the puUback of 7r^(5,^f ) to Ti will consist of i geodesies plus a single isolated point, which will 
be either ri(^f) or ri(Ci). We say ri{(,f) is followed by ri(Ci) in 7r^{s^t) if ri(Ci) is the isolated point, and 
otherwise say that ti{Q) is followed by Ti(^f) in 7r^{s^t). 

If Ti(^f) is followed by Ti(Ci) in 7T^{s^t) then let Si = and ti = ("i- If, on the other hand, Ti(Ci) is 
followed by ri(^f ), let Si = Q and ti = ^f. Then 

7r\s,t) = [[ri{s),ri{si)[[ © (^0 [[ri(t,), ri(.,+i)[[^ © [[ri{U).ri{t)]]. 

We now construct a path 7r^(s,t) between r2{s) and r2{t) in G2. For each vertex identification in G2, 
one of the vertices is r2{r]f); the other is r2{r]f,r]^) = r2(7i) for some 7^ (where r2{r]f^r]^) is the vertex at 
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distance 2rf( from the root on the path [[p2, T2(7^f )]]). If Si = then let Si = rjf and ti = 7^; if Si = d then 
let Si = and ii = r]f. Then the vertex identifications in G2 are {r2{si),r2{ii)). 
Now set 

n\s,t) = [[r2{s)Msi)[[ © (^0 [[r2{s,)Mm © [[r2ft), r2(t,)[[ [[r2(t.), r^C^.+i) [[j 

© [[r2{se),r2{se)[[® [[r2(t^), r2(t^)[[ © [[r2(t^), r2(t)]]. 

The difference in length of 7T^{s^t) and 7r^(s,t) may be bounded as follows: 

i-i 

|7r^(5,t)| - \7r^{s,t)\ < \di{s,si) - d2{s,Si)\ + ^ (^i, s^+i) - 6^2(^2,5^+1)! 

i=l 
i 

^\di{te,t) - d2{te,t)\ + ^ (^^2(5^, 5^ + 6^2(^2, • (H) 

We bound the first three terms on the right-hand side together. Recalling the definition of the distances di 
and (^2, we obtain that for any r, r' G [0, a], 

\di{r,r')-d2{r,r')\<4\\hi-h2l 

Hence, the first three terms in (11) are together bounded above by 4(£ + l)||^i — ^2||- We now turn to the 
last term in (11). We have 

d2{s^,Si) ^ d2{U,U) = d2{if,Vf) ^ d2{Cinz). (12) 
for 1 < i < Since — ^ we have — r]f\ < S and so we get 

d2i^f,vD = h2m + h2ivf)-2 inf h2ir) < 2 sup |/i2(r) - /i2(r')|. (13) 

<r<^f Vryf \r-r'\<6 

Now consider (i2(Ci,7i)- If hi^f , ^i),r2{Vi ^Vi) ^ [[^2{^f),r2{r]f)]] then we straightforwardly have 

^2(0,7.)<^2(ef,/?f)<2 sup \h2{r)-h2{r% (14) 

|r— r' I <(5 

If, on the other hand, at least one of r2(^f , ), f2(7^f , r^f ) is not in [[T2(^f ), r2(7^f )]], then both must be on 
the path from the root p2 to one of r2(^f ) or r2{r]f). Then 

d2{Ci,7i) = \^!-v!\<5. (15) 

It follows from equations (12) to (15) that 

y2{d2{s,,Si)^d2{U,U)) <£(s^4 sup \h2{r) - h2{r')\] . 
i=i \ \---'\<^ J 

Since i < putting the different parts of (11) together, we get 



\7r\s,t)\-\7r\s,t)\<4{k^l)\\hi-h2\\^kls^4 sup \h2{r) - h2{r')\ ] . 

\ \r-r'\<5 J 

An identical argument provides a bound on |7r-^(5,t)| — |7r^(5,t)| and, since sup|^_^/|<^ |/ii(r) — hi{r')\ < 
2\\hi - /12II + sup|^_^/|<5 \h2{r) - h2{r^)\, we obtain 



dGH{GuG2)<l{k^l)[S^12\\h,-h2\\^4 sup |/i2(r)-/i2(rOI 1 , 

Z y \r-r'\<6 J 

which completes the proof. □ 
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4.4 Gromov— HausdorfF convergence of connected components 

We are now almost ready to prove the convergence of G^, a connected component of G(n,p) conditioned to 
have size m, to a continuum random graph. For a suitable continuous embedding of G^, we will prove the 
following theorem. 

Theorem 22. Suppose that a > 0. Let m = m(n) G Z+ he a sequence of integers such that n~'^^^m a 
as n ^ oo, and let p = p{n) G (0, 1) be such that pn ^ 1. Let be a connected component of G{n^p) 
conditioned on having size m. Then, as n ^ oo, 

in the Gromov-Hausdorff distance. 

As mentioned before, the proof uses crucially the connection between G^ {T}^, Q^), which has the same 
distribution as G^, and G^(T^, Q^), for which the limit object appears naturally. 

We now find it convenient to think of G^(T^, Q^) as a metric space with a continuous embedding. This 
is done in the obvious way by viewing each edge of the graph as a line segment of length 1, with the natural 
distance. Write m~^/^G^(T^, Q^) for this metric space with all edges rescaled by m~^/^. We will begin by 
proving the following lemma. 

Lemma 23. Suppose that a > 0. Let p = p(m) G (0, 1) be such that mp^^^ a. Then as m ^ oo, 
in the Gromov-Hausdorff distance. 

Proof. We again assume for simplicity that cr = 1, the general case following by Brownian scaling. We wish 
to use Lemma 21. However, somewhat inconveniently, m~^/^G^(T^, Q^) is not quite the same as the real 
graph g{H^, Q^), where is the continuous interpolation of {m~^^'^H'^{mt) : t = 0,m~^,2m~^, . . . , 1). 
The reasons for this are twofold. Firstly, the real tree which is the natural continuous embedding of m~^/^T^ 
is not the real tree coded by H^^ but is rather the real tree coded by (m~^/^G^(2mt), < t < 1), where 
G^ is the contour process of the tree T^. Secondly, in ^( • , •) the points of the point set induce vertex 
identifications, rather than the insertion of an edge of length m~^/^ between the vertices concerned. Neither 
of these facts will pose a problem, but they make the line of reasoning somewhat less clear. 

We will first deal with approximating m~^/^G^(T^, Q^) by g{h^ Q) for a suitable function h and a 
suitable pointset Q. Write G^ for the rescaled version of G^, i.e., 

C!^{t) := m-^/2G^(2mt), < t < L 

Then this function encodes the correct real tree. However, if we use the points Vm = {{rri~^i^m~^^'^j) : 
(i, j) G Q^}, we identify the wrong vertices. In order to get the correct vertices, we define 

K{i) = 2i- H'^ii), 0<i<m-l. 

Then we have that the contour process travels from the {i + l)-st vertex visited in depth-first order to the 
{i + 2)-nd in the interval [K{i)^K{i + 1)] (see Section 2.4 of Duquesne and Le Gall [18] for a proof). In 
particular, G^(i^(i)) = ^^(z). Now let 

Qrn = {((2m)-^K(z),m-^/^j) : (ij) G < j < H^{i)}. 

Then Qm gives the right set of vertex identifications to make. So we will approximate m~^/^G^(T^, Q^) by 
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We now need to deal with the fact that, although g{C'^ ^ Qm) has the correct underlying tree and identifies 
the correct vertices, we actually insert edges (of length m~^/^) in m~^/^G^(T^, Q^). Suppose there are k 
vertex identifications. Take the correspondence between the vertices of the metric spaces g{C^^Qm) and 
m~^/^G^(T^, QP) given by the fact that they have the same underlying (real) tree. Now, m~^/^G^(T^, Q^) 
has line segments inserted between vertices which are identified in g{C^ ^ Qm)- In the correspondence, map all 
of the elements of one of these segments in m~^^'^G^ (T^^ Q^) to the single (identified) vertex in g{Cl^^ Qm)- 
Then the distortion of this correspondence is at most km~^^'^ ^ since the distance between any two vertices 
in m~^^'^G^ (T^^ Q^) differs by at most km~^^'^ from the distance between the corresponding vertices in 
^/(Cr.2m). It follows that 

^G//(m-^/^G^(f^,Q^),^((7r,Qm)) < \m-^^^\Vm^{m-^^^H^{[m-\/2)\. (16) 

Recall that M = g(2e^V). We next bound dGH{g{CJ^ ^ Qm)^ gi'^e^V)). In order to do this using 
Lemma 21, we need an upper bound on the Hausdorff distance between and V He. We have 

dHiQm.Vne) < d^(Q^,7^^n(m-i/2^-(Lm-J)/2))+d^(7^^n(m-i/2^-(Lm-J)/2),7^ne). (17) 

There is a one-to-one correspondence between the vertices of Qm and those of Vm H (m~^/^J^^([m • J)/2); 
indeed, by definition, each point {m~^i^m~^^'^j) G Vm H {m~^^'^H^{lm • \) /2) corresponds to a point 
((2m)-ii^(i),m-i/2j) G Qm- But then 



dH{Qm.Vmn{m-'/^H^{lm-\)/2)) < - sup 

^ 0<i<m 



K{i) 



- sup \H^{i)\. (18) 

^ 0<i<m 



2 

By Lemma 18, Theorem 10 and Skorohod's representation theorem, there is a probability space where 

((m-^/2^^([mtJ),0 < t < 1), (C7^(t),0 < t < l),Vm H (m-^/2^^( [mtj )/2, < t<l))^ {2e,2e,V D e) 

almost surely. Let e > 0. Then, there exists an almost surely finite random variable Y such that for all 
m>Y, the following hold: 

• \Vmn{m-^/^H'^{lm-\)/2)\ = \Vne\, 

• dH{Vmn{m-^/^H^{lm-\)/2)),Vne)<e, 

• supo<,<i|Cr(t)-2e(t)| <e, and 

• supo<^<^ I^'^IOI < ^e. 

Note that Qm = Qm H {CJ^/2) since the points contained in Qm are already below {CJ^/2). Then, by 
Lemma 21, (17) and (18), for ah m>Y, 

^G//(^(Gr,Qm),^(2e,7^))<i(|7^ne| + l)(l4e + 8 sup \~e{s) - ~e{t)\Y 

^ \ \s-t\<2e J 

(We remark that Lemma 21 is applied with 5 = 2e, not S = e. This is essentially because of the factor 2 
rescaling of the point sets that occurs prior to vertex identification.) By (16), we then get that 

dGH{m-'/'G''{f^, Qn.gi^e, V)) < + m-^/^)\V H e| + 1) (Me + 8 sup \~e{s) - e(t)| ) (19) 

^ V \s-t\<2e J 

for all m> Y . 
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Now note that, by definition, tiie excursion measure associated witii a tilted excursion e is absolutely 
continuous with respect to Ito's excursion measure. It follows by Levy's Modulus of Continuity Theorem 
(see, for example, [46, 48]) that a tilted excursion e satisfies 



sup \e{s)-e{t)\ >e^/^ 

.\s-t\<2e J 







(20) 



as e ^ 0. Furthermore, given e, |P H e| has a Poisson distribution with mean l^e(^s)ds^ and so 

^(\V n e| > e-^/^) <^[\V^~e\> e'^'^ e{s)ds < e"^/^^ ) ^ ^ (/^ ^^^^"^^ - ^"'^'"^ 



/ e{t)dt 
Jo 



A/16 



by Markov's inequality. Since e(i)dt has finite expectation and Y < oo almost surely, we obtain from (19) 
and (20) that 

^G//(m-^/2G^(f^,Q^),^(2e,7^)) ^0. (21) 



The result follows. 



□ 



Proof of Theorem 22. Once again, we treat the case cr = 1, which implies the general result. By Lemma 17, 
is distributed as G^(T^, Q^). Recah from Lemma 18 that and H"^ denote the depth-first walk and 
height process of T^, respectively. In order to apply the result of Lemma 23, it will be convenient to be 
working with almost sure convergence instead of weak convergence. By Lemmas 16 and 18 and Skorohod's 
representation theorem, there is a probability space in which m-^/2||X^ - H'^/2\\ almost surely and 

( ( H'^iWnU) \ /C^(2mt) \ /x^(MJ) \ 



(2e, 2e,e,Pne), 



_ /i^^(MJ) 



almost surely. Recall that Vm = {{m ^i^m ^l^f) : (^, j) ^ Q^}. Then the size of the symmetric difference 

(Q^nx^)A(Q^n(^^/2)) 

is stochastically dominated by a Binomial random variable with parameters [m||X^ — ^^/2||] and m~^/^. 
Since m-i/2||X^ - ^^/2|| ^ 0, 



(Q^ n X"^)A(Q^ n (^"^/2))| ^ 0. 



(22) 



Since this random variable is integer- valued, it follows that P(Q^ D = H {H^ /2)) ^ 1 as m ^ oo. 
Then, by Lemma 20, on the event {Qp fl = fl {H"^ /2)} we have 

dGH{G''{fl, Qn.G^'iTl, QP)) < \QP n X-| • (||X- - H^/2\\ + 2). 

So, for any e > 0, 

< P(||X™ -iJ"'/2|| > em^/*^ -2) +P(|Qf nX™| > m^^^) +r{QP n X"" ^ QP(1{H"'/2)) 

< (1 + o(l))i^e-^/^m-^/i6 + m-^^^E[\QP D X"*!] + P(Qf n X™ 7^ Qf n (^™/2)), 
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by Lemma 16 and Markov's inequality. Since E[|Q^nX^|] ^E[\Vn e\] = E[/q e{s)ds] < oo, the right-hand 
side tends to zero. Finahy, 

and so from (21) we obtain that dcHim-^^^G^ {fp^, Qp),M) ^ 0. Since is distributed as {fp^, Qp) 
and m~^/^n^/^ 1, the result follows. □ 

We finish this section by stating an easy corollary on the number of surplus edges of G^. 

Corollary 24. Suppose that m = m{n) is such that n~'^^^m ^ a as n ^ oo and let p = p{n) be such that 
pn ^ 1. Then as n ^ oo, 

s{GPJ ^ Poisson (^j^ e^''\u)di^ . 
Proof. This follows immediately from the observation that 

s(G^) = IQP nX™| = n ((m/a)-i/2x™([(m/cT)tJ)/2,0 < t < a)\, 
and from Lemma 16, (22) and Proposition 19. □ 

5 The limit of the critical random graph 

Recall that we are interested in G{n^p) with p = -\- An~^/^, for A G M. We will begin by recalling some 
more details of Aldous' limit result from [7]. His principal tool is the so-called breadth-first walk on G{n^p). 
This is very similar to our depth-first walk, except that the vertices are considered in a different order. The 
breadth-first ordering vq^vi^. . . ^Vn-i on the vertices of the graph is obtained as follows. (We deliberately 
use the same notation as in our definition of the depth-first ordering.) For z > 0, we define the ordered set 
Oi of open vertices at time z, and the set Ai of the vertices that have already been explored at time i. We 
say that a vertex u has been seen at time i if u e Oi U Ai. Let q be a counter which keeps track of how 
many components we have looked at so far. 

Initialization Set Oo = (1), Ao = 0, cq = 1. 

Step i {0 < i < n — 1): Let Vi be the first vertex of Oi and let A/^ be the set of neighbors of Vi 

in [n] \ {Ai U Oi). Set Ai-\-i = AiU {vi}. Construct Oi+i from Oi by removing Vi from 
the start of Oi and affixing the elements of Afi in increasing order to the end of Oi \{vi}. 
If now Oi+i = 0, add to it the lowest-labeled element of [n] \ Ai-\-i and set q+i = q + 1. 
Otherwise, set q+i = q. 

The only difference between this procedure and the one introduced in Section 2 is that the word "start" has 
been changed to "end" (italicized above). Now define Yn{i) = \Oi \ {vi}\ — (q — 1). Then (y^(z),0 < i < n) 
is called the breadth-first walk on the graph. It is straightforward to see that {Yn{i)i^ < i < n) attains a 
new minimum every time that Vi is the root of a new component. This enables us to interpret component 
sizes as excursions above past minima of the breadth-first walk. Aldous proved that 

n-^'\Y^{\n'"h\),t > 0) ^ {W^{t),t> 0) (23) 
as n ^ OO in D(R+,R+), with convergence (as is usual) uniform on compact time-intervals. Here, 

W\t) = W{t) + \t-^^, 
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where is a standard Brownian motion. It is not hard to see that the breadth-first and depth-first walks 
are interchangeable here, and that the identical result holds for the depth-first walk. Since we do not actually 
need this result, we will not go further into details here. 
Now define 

B^{t) = W^{t) - min W^{s), 

0<s<t 

the reflecting process. The excursions of this process correspond to "components" of the limiting graph. As 
stated by Aldous, there is an inhomogeneous excursion measure associated with this B^^ in the same way as 
Ito's excursion measure is associated to a reflecting Brownian motion. Recall from (2) that S is the space of 
continuous excursions of finite length. 

Denote Ito's measure by N and the excursion measure associated to by (i.e. for excursions starting 
at time t). Then, as argued by Aldous in the proof of his Lemma 26, we can calculate the density of with 
respect to N. For clarity, we will repeat his argument here. Firstly note that = Nq~* and so it suffices to 
find for all A G M. 

Write W = {W{t)^ < t < a) for the canonical process under N. Then by the Cameron-Mart in-Girsanov 
formula [46, 49], applied under N, 



By integration by parts, we have 



exp 



(^l\x-s)dW{s)-^l\\-sfds]. 



/ (A - s)dW{s) = / W{s)ds 
Jo Jo 

the area under the excursion W. So we can re- write 



dn^ 
In 



exp 



W{s)ds - -{{a - \f + \')) 





We know that we can define a normalized excursion measure N( • |cr = x) for each x > 0, which is, in fact, a 
probability measure. There is a corresponding probability measure Nq ( • |cr = x) which, for B C S a Borel 
set, is determined by 



N[exp {J^W{s)ds) ls|cr = , 



N[exp {J^W{s)ds) \cT = x] 
Note that this quantity is independent of A. By Brownian scaling. 



N 



exp 



N 



exp 



x^/^ / W{s)ds 



a = 1 



E 



exp X 



.3/2 



e{s)ds 



where (e(5),0 < s < 1) is a standard Brownian excursion under E. Similarly, for any suitable test function 
/ of the excursion. 



f^o[f{W{s),0 < S < X)\CT = x] 



E 



f{^e{s/x), < 5 < 1) exp ^x^/^ e{s)dt 



E 



exp 



Putting all of this together, we see that the inhomogenity of the excursion measure lies entirely in the 
selection of the length of the excursion. So to give a complete description, we just need to determine 
N^(cr G dx). We know that N(cr G dx) = (27i)~^^'^x~^^'^dx and so 



N^(cr G dx) = (27r)-^/^x-^/^ 



exp 



((x-A)^ + A^) N 



exp 



W{s)ds 



a = X 



dx. 
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To recapitulate: the excursion measure at time t picks an excursion length according to Nq~*((J g dx). Then, 
given cr = X, it picks a tilted Brownian excursion of that length. This is the crucial fact that allows us to 
use Theorem 22 and the results of Section 4 about the limit of connected components. It is not surprising 
that it holds, however, since the components of G{n^p) likewise have the property that one can first sample 
the size and then, given the size, sample a connected component of that size. 

Let = (Cf,C2,...) be the components of the random graph G{n,p) with p = 1/n + An~^/^, in 
decreasing order of their sizes, > Z2 > . . . respectively. Let = (Zf , Z2 , . . .)• As a consequence of 
(23), Aldous [7] proves that 

4 (24) 

where Z is the ordered sequence of excursion lengths of and convergence is in i'^. Let = (Mf , . . .) 
be the sequence of metric spaces corresponding to these components. Recall the definition of A^^^^ from the 
start of Section 4.2. We next state a more precise version of Theorem 2. 

Theorem 25. As n 00, 

(n-2/^Z^,n-^/^M^) 4 (Z,M), 
where M = (Mi, M2, . . .) is a sequence of metric spaces such that, conditional on Z, Mi, M2, . . . are inde- 
pendent and Mi = M^^'\ Convergence in the second co-ordinate here is in the metric specified by (1). 

In proving Theorem 25, we need one additional result, on the expected height of the tilted trees 
introduced in Section 2. This lemma is essentially what allows us to use the distance (1), rather than 
product convergence. 

Lemma 26. Letp = l/n + An~^/^. There exists a universal constant M > such that for all n large enough 
that l/(2n) < p < 2/n and p < 1/2, and all 1 < m < n, 

< M • max(mS-^ 1) • 

Before we proceed with the proof, note that the bound in Lemma 26 tells us that tilted trees of size of 
order n^/^ behave more or less like uniform trees. (See the moments of the height ||T^|| of uniform trees 
in [22, 23, 45].) Trees of size much larger than n^/^ are much more infiuenced by the tilting (as witnessed 
by the factor m^n~^). 

Proof. We assume throughout that m > 2. For any x > and a > 0, we have 

P(||T^|| > xm^/2) < P(||T^|| > xm^/' | a(T^) < ax^m^^^) + P(a(T^) > ax^m^^^). (25) 

We will bound each of the terms on the right-hand side of (25) then integrate over x to obtain the desired 
bound on E[||T^||^]. (We will optimize our choice of a later in the proof.) The intuition is that when a(T^) 
is not too large, the distribution of is not too different from that of the uniformly random labeled rooted 
tree T^, and so we should be able to use pre-existing bounds on the tails of ||T^||. On the other hand, we 
have already proved (c.f. Lemma 14) bounds that will allow us to control the probability that a(T^) is large. 
We now turn to the details. 

Let q = max(m~^/^,p). By Markov's inequality and the definition of we have 



< 



E[((l-p)(l-g))-°(^"-)] 

E [(1 - 
(1-9)- 



< -T7 2 3/2 • (26) 
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Let c = 2m^/^/n, so that cm ^/^/4 < p < cm and observe that q < 5m for 5 := max(c, 1). By 
Lemma 14, there exist absolute constants > such that 

supE[(l-g)-2«(^-^)] <Ke^^^\ 

m>l 

Furthermore, since qm^^'^ ^5/4, (26) yields 

P(a(T^) > ax^m^^^) < i^e^-^'-"^'^/^ < i^e-^^'^/^ (27) 



for ah X such that x'^ > 32K.S/a. For x > y^S ln{2K)/{aS), so that e "^/^ < 1/2, since p < cm and 
(1 — p)~-^/^ < e^, we also have 

<2e2^"^' -PdlT^II >xm^/2). (28) 

We can now use tail bounds on the height of uniform labeled trees. Luczak [36, Corollary 1] provides a 
uniform tail bound on ||T^||: for some universal constant K\ and all integers m > 1, 

P(||T^|| > xm^/') < i^Ve-^'/^ 

and so taking = 8^, (28) yields 

P(||T^|| > I a(T^) < ax^m^^^) < 2K' x^e^^^''-^^^^''' < 2K'x^e-^'/\ (29) 

Notice that our requirements that x'^ > 32K,S/a and x'^ > 8 ln(2i^)/(aJ) now reduce to x > Wtv^^'^S and 
X > 8Y^ln(2i^). So, in particular, setting L = 16ai:^/^ + 8Y^ln(2i^), and recalling that 5 = max(c, 1), it 
suffices to require that x > LS. 

For all such x, combining (27) and (29) with (25) and substituting in the value of a then yields 

P(||f^|| >xm^/2) <e-^'/^^ + 2i^Ve-^'/^ <max(4i^V,2)e-^'/^^ 

Writing E [X] = F {X > t) dt, we then have 

< / P(||T^||^ > xm^)dx < m^L^S^^m^ / max(4i^V/^ 2)e-^'^'/^^dx. 

Jo Jl^5^ 

So, since 8 = max(c, 1) with c = 2m^/^/n, we have E[||T^||^] < m^M(max(m^/^/n, 1))^, for some absolute 
constant M > 0, as required. □ 

Proof of Theorem 25. In the random graph G{n,p)^ conditional on Z"^, the components Mf,M2^,... are 
independent and 

where as above, p = -\- An~^/^. Note that ^ 1 as n ^ oo. By (24) and Skorohod's representation 
theorem, there exists a probability space and random variables Z"^,M"^,n > 1 and Z,M defined on that 
space such that (Z^,M^) = (Z^,M^), n > 1, and (Z,M) = (Z,M) with n-^/^Z^ ^ Z a.s. But then the 
convergence (n~^/^Z^, n~^/^M^) -i (Z,M) in the product topology follows immediately from Theorem 22. 
We can, and will hereafter assume, again by applying Skorohod's theorem, that (n~^/^Zf , n~^/^M^^) 
{Zi, Mi) almost surely for all i. It remains to prove convergence in distribution in the metric specified by (1). 



28 



In doing so, we will need to use the oDFS procedure. For any n and i for which MJ^ is defined, we may view 
M-^ as a finite connected graph; this graph is uniquely specified (up to isomorphism) by M^. When we write 
oDFS (MJ^) we mean the oDFS procedure run on a uniformly random labelling of the graph corresponding 
to Ml". 

To prove convergence in the metric specified by (1), we first observe that for any sequences of metric 
spaces A, B and any integer > 1, we have 

/N-l \l/4 xl/4 

^(A,B) < f ^dGH(A„5,n + i^dGM.B.fj . 

Since we have already established convergence in the product topology, to complete the proof it thus suffices 
to show that for all e > 0, 

lim limsupP I VdGH(n"^/^Mf,M^)'^ > e I =0. (30) 

N^oo n^oo J 

As earlier, we write || • || for the height of a rooted tree or the supremum of a finite excursion. For any i and 
n, we may bound use the bound 

6^GH(n-^/^Mf,M,)^ < 16(n-^/^||Tf + ||e(^^)||4), (31) 

where is the depth-first tree corresponding to oDFS(M[^) started at its smallest vertex, and e*^^^^ is the 
excursion corresponding to M^. Now let 

By Brownian scaling, given the length Z^, we have that \\e^^^^ \\^ = Zf'\\ei\\^^ where {e^, z > 1} are independent 
and identically distributed copies of e, a tilted excursion of length one, and which are independent of {Z^, i > 
1}. Combining the preceding equalities with (31), we thus have 



4 / 2 

i=N i=N ^ 



Next, given S > write Ns = N§{Z) for the smallest TV such that < S; N is almost surely finite since Z 
is almost surely an element of i?^. For any S > and all n, setting ei = e/16 we then have 

p(EdGH(n-i/3Mr,Mi)4>eJ <p( ^ (sf (n-^^'Z^f + Zf\\e,\A > ei J + P(iV, > TV) . 

\i=N I \i>Ns ^ ' ) 

Since limAr^ooIP(-^5 > A'') = 0, and the first probability on the right-hand side of the preceding inequahty 
does not depend on A?^, we thus have 

hm hmsupP ( VdcH fn-i/3Mf,Mi)'' > e I < hmsupP ( V fs>-'*/3(Zf )2 + ZfHe^f ) >ei |. 

Since this holds for any 5 > and the left-hand side does not depend on 5, we then obtain 

2 \ \ 



lim limsupP I V dcH fn-^/3M",Miy > e | < limlimsupP ( V ( 5" f4W^ 



^flleiirj >eij. 

(32) 
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But it follows from Corollary 2 and Lemma 14 (b) of [7] that for all 7 > 0, 

2 \ / 



limlimsupP >^ 

SiO ^ ^^^^ \ ^ 



i>Ns 



> 7 1 = 0, and limP ^ > 7 = 0, 



from which we may complete the proof straightforwardly. First recall that e^, i > 1 are independent and 
identically distributed tilted excursions of length one. Moreover, E [HeiH"^] < 00, using the change of measure 
in the definition of and the Gaussian tails for the maximum ||e|| of a standard Brownian excursion e [30]. 
Let 62 = ei/2, and choose 6 > small enough that 



> 



\i>Ns 



2E[||er] 



^2 
< — . 

- 2 



Then, by Markov's inequality, we have 



Ki>Ns 



E^ 



E^^ 



< 



i>Ns 



2E[\\i 



- 2 €2 



E^: 

i>Ns 



E^l 

i>Ns 



< 



2E[||e||4] 



since {e^, i > 1} is independent of the set of lengths {Zi^i > 1}. Since e = 32e2 was arbitrary, it follows that 



limi 

610 



\i>Ns J 



We may apply an identical argument to bound the terms involving discrete random variables and show that 



limlimsupP f V (Zj'n-'^^^Y > 62] = 0, 
.io > J 



and thereby complete the proof, as long as we can show that sup^^E[S^] < 00. We now prove that this 
is true. Recall that, by definition, = ||T/^||^/(ZJ^)^. Recall also that for a given n and z, conditional on 
= the tree T-^ is distributed as from Section 2. In other words, we have 

for each tree T on [m], where by T-^ = T we mean that the increasing map (i.e. respecting the increasing 
order of the labels) between vertices of and T induces an isomorphism. 

To bound E[S^], we use Lemma 26, together with bounds on the size of the largest component of G(n,p) 
for p= l/n + An-'^/^ For our purposes, the latter bounds are most usefully stated by Nachmias and Peres 
[42] (see also [44, 50]). They proved that for ah fixed A G M, there exist 7 = 7(A) > and C = C(A) > 1 
such that for all n and for all x > C, 
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(In fact, the bound in [42] is slightly stronger than this.) For all integer i > 1, we thus have 

n 

E[sa< sup m-'E[\\fn\'\Z^ = m]+ ^ m-'E[\\fn' \ = m] -FiZ^ = m) 

— m= I Cn^/"^ I 

n 

< sup m-'E[\\f^f]+ rn-'mf^f]-nz^>m) 

— rn=\Cn^/'^\ 



< sup m-'E[\\fU']+ E m-2E[||fP||4].e- 

— rn= I Cn^/-^ \ 



Applyling Lemma 26 to the last expression, we immediately obtain 

n 

which is uniformly bounded in both i > 1 and n > 1, as required. □ 

We finally turn to the diameter of the random graph G{n,p). Note that, in order to prove convergence 
in the metric (1), we in fact proved that for all e > 0, 

lim limsuppf y n-^/^||T;^||^ > e I =0 and lim P| V ||e^^^^ H"^ > e | =0. 

\i>N I \i>N J 

Since diam(M;^) < 2||f;^|| and diam(Mi) < 2||e(^^)||, we must thus have 

lim lim sup P sup n -^^^diam{M^)^ > e = and lim P sup diam(Mi)'^ > e = 0, (33) 

N^oo n^oo \i>N J N^oo \i>N J 

which allows us to prove the convergence of the diameters. For i = 1,2,... let Df^ = diam(MJ^) if has 
at least i components, and Df = otherwise, and let 

L)^ = max{diam(Mf)} 

i>l 

denote the diameter of G{n,p). We remark that Aldous discusses the diameter of continuum metric spaces 
and the Brownian CRT in particular [5, Section 3.4]. We can now prove Theorem 5 stated in the introduction, 
which claims that n~^^^D^ —> for a random variable D > with finite mean and absolutely continuous 
distribution. 

Proof of Theorem 5. For each z = 1, 2, . . . let I)^ = diam(M^), and let D = sup{diam(M^) : i > 1}. Observe 
that for any two metric spaces M and |diam(M) — diam(7V) | < 2dGu{M^ N). For fixed i, the claimed 
convergence is immediate from Theorem 25 since 

|n-^/^diam(Mf ) - diam(M^) | < 2dGH(n"^/^Mf , 

and n~^^'^M^ Mi in distribution in the Gromov-Hausdorff distance. Also diam(M^) is a non-negative 
random variable with finite mean since diam(M^) is at most the diameter of the underlying continuum 
random tree: diam(Mi) < 2\\ei\\y/Zi^ where is a tilted excursion of length one. It follows immediately that 
for any fixed 

n-^l'^ max{D,^, 1 < i < TV} 4 max{A, 1 < i < A^}. (34) 
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Next, observe that for any e > 0, there exists c > such that F {D < c) < P (I^i < c) < e/2. Since 
T^-i/S/^n _^ must then have that, for all n large enough, 

P(n-^/^D^ < c) < Fin-^^^D"^ < c) < e/2. 

Now fix any e > and take N large enough that for all n sufficiently large, 

P ( sup n"^/^diam(Mf ) > c] < e/2 and P ( sup diam(MO > c] < e/2; 

\i>A^ / \i>N J 

such an must exist by (33). Then for all n sufficiently large, by the preceding equations 

¥[ D"" max Df]<e and ¥ ( D max A I < e. 

y i<i<Ar J ~ \ l<i<N J ~ 

Since e > was arbitrary, combining these inequalities with (34) yields that n~^^^D^ as required. It 

follows straightforwardly from the behavior of the tail of the sequence {Di^i > 1) that D has an absolutely 
continuous distribution. Finally, the fact that E [I)] < oo is a direct consequence of [3], Theorem 1. □ 

6 Concluding remarks 

In this paper we have proved that it is possible to define a scaling limit for critical random graphs using 
random continuum metric spaces. This gives us a systematic way to consider a great many questions about 
distances in critical random graphs. In particular, it allows us to prove that critical random graphs have a 
diameter of order n^/^ which is not concentrated around its mean. Focussing just on the diameter, there are 
now several questions which might deserve another look: what is the probability that the largest component 
achieves the diameter? What is the distribution of the (random) point A G M (where p = 1/n + An~^/^) at 
which the diameter of the random graph maximized? What is the distribution of this diameter? 

The proof of our main result relies on a careful analysis of a depth-first exploration process of the graph 
which yields a "canonical" spanning forest and a way to add surplus edges according to the appropriate 
distribution. The forest is made of non-uniform trees that are biased in favor of those with a large area. In the 
limit, these trees rescale to continuum random trees encoded by tilted Brownian excursions. We have limited 
our analysis of these excursions to a minimum, but it seems likely that much more can be said, which might 
in turn yield results for the structure of the graphs or the behavior of other graph exploration algorithms. 

In this paper, we have very much relied upon the depth-first viewpoint. Gregory Miermont [40] has 
suggested that, at least intuitively, there should be an analogous breadth-first approach to the study of 
a limiting component, in which one might think of the shortcuts as being made "horizontally" across a 
generation rather than "vertically" along paths to the root. The advantage of the depth-first walk is that it 
converges to the same excursion as the height process of the depth-first tree. The rescaled breadth-first walk, 
however, converges to the same limit as the rescaled height profile (i.e. the number of vertices at each height) 
of a "breadth-first tree", which contains less information and, in particular, does not code the structure of 
that tree. As a result, it seems that it would be much harder to derive a metric space construction of a 
limiting component using the breadth-first viewpoint. It may, nonetheless, be the case that the breadth-first 
perspective is better adapted to answering certain questions about the limiting components where only the 
profile matters. 

In a companion paper [2], we will describe an alternative construction of the limit object which has 
the cycle structure of connected components at its heart: a connected component may be described as a 
multigraph (which gives the cycles), onto which trees are pasted. Together with the results of this paper, 
the latter perspective yields many limiting distributional results about sizes and lengths in critical random 
graphs. 
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